Benchmarking vision kernels and neural network inference accelerators on embedded platforms
نویسندگان
چکیده
Developing efficient embedded vision applications requires exploring various algorithmic optimization trade-offs and a broad spectrum of hardware architecture choices. This makes navigating the solution space finding design points with optimal performance challenge for developers. To help provide fair baseline comparison, we conducted comprehensive benchmarks accuracy, run-time, energy efficiency wide range kernels neural networks on multiple platforms: ARM57 CPU, Nvidia Jetson TX2 GPU Xilinx ZCU102 FPGA. Each platform utilizes their optimized libraries (OpenCV, VisionWorks xfOpenCV) (OpenCV DNN, TensorRT DPU). For kernels, our results show that achieves an energy/frame reduction ratio 1.1–3.2× compared to others simple kernels. However, more complicated complete pipelines, FPGA outperforms ratios 1.2–22.3×. [Inception-v2 ResNet-50, ResNet-18, Mobilenet-v2 SqueezeNet], it shows speed up [2.5, 2.1, 2.6, 2.9 2.5]× EDP [1.5, 1.1, 1.4, 2.4 1.7]× FP16 implementations, respectively.
منابع مشابه
Benchmarking Methodology for Embedded Scalable Platforms
Embedded scalable platforms (ESP) are a novel generation of platform architectures that yield optimal energy-performance operations while supporting a diversity of embedded application workloads. A companion methodology combines full-system simulation, pre-designed HW/SW interface libraries, high-level synthesis and FPGA prototyping to enable an effective design-space exploration which is drive...
متن کاملA Preliminarily Study on Embedded Platforms for Computer Vision Applications
Today, silicon chips have become more and more powerful despite reduction in size. Various general processors, digital signal processors (DSPs) and graphic processing units (GPUs) have been embedded in countless electronic gadgets such as Personal Digital Assistants (PDAs), mobile hand-phone, digital cameras etc. With the high processing power of these embedded platforms and ever increasing siz...
متن کاملBenchmarking Probabilistic Neural Network Algorithms
The progress of research in probabilistic neural network (PNN) and related issues is straight related to directly compare the performance of different PNN algorithm versions. In most cases, the PNN application in real life issues involves the classical activation function (Parzen-Cacoulos estimator) only. Although this estimator has been used in most experimental works so far, it is not the onl...
متن کاملBenchmarking CPUs and GPUs on Embedded Platforms for Software Receiver Usage
Smartphones containing multi-core central processing units (CPUs) and powerful many-core graphics processing units (GPUs) bring supercomputing technology into your pocket (or into our embedded devices). This can be exploited to produce power-efficient, customized receivers with flexible correlation schemes and more advanced positioning techniques. For example, promising techniques such as the D...
متن کاملDesign, Development and Evaluation of an Orange Sorter Based on Machine Vision and Artificial Neural Network Techniques
ABSTRACT- The high production of orange fruit in Iran calls for quality sorting of this product as a requirement for entering global markets. This study was devoted to the development of an automatic fruit sorter based on size. The hardware consisted of two units. An image acquisition apparatus equipped with a camera, a robotic arm and controller circuits. The second unit consisted of a robotic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Systems Architecture
سال: 2021
ISSN: ['1383-7621', '1873-6165']
DOI: https://doi.org/10.1016/j.sysarc.2020.101896